arXiv:1505.00542v2 [cs.IT] 5 May 2015 


A deterministic algorithm for the distance and 
weight distribution of binary nonlinear codes 

Emanuele Bellini, and Massimiliano Sala. 

Abstract 

Given a binary nonlinear code, we provide a deterministic algorithm to compute its weight and distance distribution, and 
in particular its minimum weight and its minimum distance, which takes advantage of fast Fourier techniques. This algorithm’s 
performance is similar to that of hest-known algorithms for the average case, while it is especially efficient for codes with low 
information rate. We provide complexity estimates for several cases of interest. 

Index Terms 

Distance distribution, minimum distance, weight distribution, minimum weight, non-linear code 


E. Bellini, Telsy S.p.A., Italy, 
e-mail: eemanuele.bellini@gmail.com. 
M. Sala, University of Trento, Italy, 
email: maxsalacodes@gmail.com 


JOURNAL OF UTeX CLASS FILES, VOL. ??, NO. ?, MONTH YEAR 


1 


A deterministic algorithm for the distance and 
weight distribution of binary nonlinear codes 


I. Introduction 

Let C be a nonlinear code, that is, a code which is not necessarily linear. There are some related computational problems 
which are of interest, that we list as the computation of; the distance distribution (A), the minimum distance (Al), a minimum- 
distance codeword-pair (A2), the weight distribution (B), the minimum weight (Bl), a minimum-weight codeword (B2). The 
decoding performance of C can be established by solving Problem A and can be estimated by solving Problem Al. 

Remark 1: Solving Problem A2 (respectively, B2) implies solving Problem Al (Bl), but the converse does not hold. However, 
it is noteworthy that no known algorithm is able to solve Al (Bl) without solving A2 (B2). 

If C is linear. Problem A (respectively, Al, A2) and B (Bl, B2) are equivalent. This holds also for some nonlinear codes, 
called distance-invariant codes [Mit89], and many of these are optimal codes (e.g. the Preparata-Kerdock codes [Pre68]). When 
C is linear, we consider also the decoding problem, which is implied by solving Problem B2 in the suitable code coset (which 
is a nonlinear code). Observe that the considerations in Remark 1 remain valid also if we restrict to linear codes. 

In the linear case it is convenient to use probabilistic algorithms for the computation of the minimum distance, such as the 
Brouwer-Zimmerman algorithm [Zim96], or any of its variations, e.g. [CC98]. 

We note that these algorithms must actually retrieve (at least) one minimum-weight codeword in order to obtain the minimum- 
weight value. 

In the nonlinear case the minimum weight and the minimum distance may be different. For some classes of nonlinear codes 
there are algorithms which perform much better than brute force, e.g. code with large kernel ([PVZ12], [VZP14]) or additive 
codes ([WG06]). However, in the general nonlinear case it is not possible to improve significantly on the brute-force approach, 
as shown in [GOSIO]. Indeed, we are not aware of any non-exponential probabilistic or deterministic algorithm to solve any of 
the problems A, Al, A2, B, Bl, B2. In particular, to compute the weight distribution of a generic binary (n, 2^)-nonlinear code 
given as a list of binary vectors, we need to perform 0{n2^) bit operations, while finding the distance distribution requires 
0(n2^^) bit operations. 

The main result of this paper is a deterministic algorithm to compute the distance and weight distribution, and thus the 
minimum distance and the minimum weight, of any random binary code represented as a set of Boolean functions in numerical 
normal form (NNF). Our method performs better than brute force for those codes with low information rate and sparse NNF 
representation, while in the general case, it achieves the same asymptotic computational complexity as brute force methods. 
In Section II, after some preliminaries on Boolean functions, we argue that representing a code as a set of Boolean functions in 
NNF does not have any particular drawback with respect to the classical representation of a code as a set of binary vectors. In 
Section III, to each binary code we associate a polynomial whose evaluations are the weights of the code. Similarly, in Section 
IV, to each binary code we associate a polynomial whose evaluations are the distances of all possible pairs of codewords. Given 
these two polynomials we are able to compute the weight and the distance distribution of any binary nonlinear code. Finally, 
in Section V we provide some complexity considerations regarding our algorithms. In particular, we show that, to compute 
the weight distribution starting from the NNF representation of a binary nonlinear code has a complexity of 0{{n/h -f fc)2^), 
where n/h is the average number of nonzero monomials of the Boolean functions representing the code. Moreover, there are 
many important cases where our approach is provably faster than brute-force (e.g. in the linear case and in the nonlinear case 
when the NNF representation of the code is sparse), and cases where it is experimentally faster than the Brouwer-Zimmerman 
method. 


II. Preliminaries 

A. Representations of Boolean functions 

In this section we briefly summarize some definitions and known results from [CarlO] and [MS77], concerning representations 
of Boolean functions. 

We denote by F the field F 2 . The set F" is the set of all binary vectors of length n, viewed as an F-vector space. 

A Boolean function (B.f. ) is a function / : F" —F. The set of all Boolean functions from F" to F will be denoted by Bn- 
There are several ways one can uniquely represent a B.f. . We briefly outline those we need. 

1) Evaluation vector: We assume to have ordered F", so that F" = {pj^,..., P 2 »}. A Boolean function / can be specified 
by a truth table, which gives the evaluation of / at all p^’s. We consider the evaluation map: 

6„^F2” /^/ = (/(pi),...,/(p2„)). 

The vector / is called the evaluation vector of /. Once the order on F” is chosen, i.e. the pLs are fixed, it is clear that the 
evaluation vector of / identifies /. 
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2) Algebraic normal form: A Boolean function f € Bn can be expressed in a unique way as a square-free polynomial in 
¥[X] = F[a;i,... i.e. 

/ = E ’ 

where X'" = ■ ■ ■ x""". 

This representation is called the Algebraic Normal Form (ANF). 

There exists a simple divide-and-conquer butterfly algorithm ([CarlO], p.lO) to compute the ANF from the truth-table (or 
vice-versa) of a Boolean function, which requires 0(n2”) bit sums (with big O constant 1/2), while 0(2") bits must be 
stored. This algorithm is known as the fast Mobius transform. 

3) Numerical normal form: In [CG99] (see also [CGOl], [Car02]) the following representation of Boolean functions has 
been introduced. 

Let / be a function on F" taking values in a field K. We call the numerical normal form (NNF) of / the following expression 
of / as a polynomial: 

n 

/(xi,...,x„)= y] A,(na:r)= E 

ugF" i=l ueF” 

with Au G K and u= (ui,..., Un). 

It can be proved ([CG99], Proposition 1) that any Boolean function / admits a unique numerical normal form. As for the 
ANF, it is possible to compute the NNF of a Boolean function from its truth table by mean of an algorithm similar to a fast 
Fourier transform, thus requiring 0(n2") additions over K and storing 0(2") elements of K. 

From now on let K = Q. 

The truth table of / can be recovered from its NNF by the formula 

/(u) = Ev,VMeF", 

where a <u Vi G {1,..., n} < tti. Conversely, as shown in [CG99] (Section 3.1), it is possible to derive an explicit 
formula for the coefficients of the NNF by means of the truth table of /. 

Proposition 1: Let / be any integer-valued function on F". For every u G F", the coefficient A„ of the monomial in 
the NNF of / is: 

A„ = (-!)-(") y] (_l)w(a)j(„)_ (1) 

It is possible to convert a Boolean function from NNF to ANF simply by reducing its coefficients modulo 2. The inverse 
process is less trivial. One can either apply Proposition 1 to the evaluation vector of / or apply recursively the fact that 

a+wb = a+zb+z (-2a6), (2) 

and the fact that each variable has to be square-free (we are working in the affine algebra ]K[a;i, • • • , x„]/ {xl—xi, • • • , x„)). 

B. Representing a code as a set of Boolean functions 

We consider binary codes, i.e. codes over the finite field F of length n, with M codewords. A binary code C with such 
parameters is denoted as a (n, M)-code. If the code is a subspace of dimension k of (F)" then it is called linear and we 
indicate it as a [n,fc]-linear code. 

Now we show that any binary (n, 2^)-code C with 2* codewords can be represented in a unique way as a set of n Boolean 
functions /i,..., /„ : (F)^ -5- F. We indicate with a Boolean function represented in algebraic normal form, and with 
a Boolean function represented in numerical normal form. 

Definition 1: Given a binary (n, 2*)-code C, consider a fixed order of the codewords of C and of the vectors of (F)^. 
Then consider the matrix M whose rows are the codewords of C. We call the defining polynomials of the code C the set 
Xc = {fi, ■ ■ ■, fn} of the uniquely determined Boolean functions whose truth table are the columns of M. We also indicate 
with F = (/i,..., fn) G F[2f]", where X = xi,, Xk, the polynomial vector whose components are the defining polynomials 
of C. With abuse of notation, we sometimes write 

= {/f\ , /fH or Fc = {/f\ ..., /f)} 

Notice that F can be seen as an encoding function, since F : (F)* —>■ (F)". 
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1) Memory cost of representing a code: Let us call vectorial the representation of a code as a list of vectors over F, and 
Boolean the representation of the same code as a list of Boolean functions. 

For a random code, in terms of memory cost, the two representations are equivalent. In the vectorial representation we need to 
store all the components of each codeword, which are n times 2^ codewords. In the Boolean representation we need to store 
the 2^ coefficients of the n defining polynomials. In both cases we need a memory space of order 0(n2^). 

If the code C is linear it can be represented with a binary generator matrix of size fc x n. In this case the defining polynomials 
are linear Boolean functions, i.e. any is of the form G F, which means that to represent them it is sufficient to 

Store kn elements of F, yielding again an equivalent representation. 

As shown in [PVZ12], [VZP14], if C is a binary code of length n with kernel K of dimension kx and t coset leaders given 
by the set S = {ci,... ,ct}, we can represent it as the kernel K plus the coset leaders S. Since the kernel needs a memory 
space of order 0{nkK), then the kernel plus the t coset leaders takes up a memory space of order 0{n{kK + t))- When C is 
linear then C = ker(C'), so the generator matrix is used to represent C. On the other hand, when f +1 = \C\, then representing 
the code as the kernel plus the coset leaders requires a memory of 0{n\C\) = 0(n2^) (since we are supposing the code has 
2^ codewords). In the latter case, a Boolean representation could be more convenient. Another situation in which a Boolean 
representation is more convenient is the case where the dimension k of the code is much less than the length n, i.e. when 
certain components have to be repeated. 

It is worth noticing that a linear structure of a nonlinear binary code can be found over a different ring. For example there 
are binary codes which have a Z 4 -linear or Z 2 Z 4 -linear structure and, therefore, they can also be compactly represented using 
quaternary generator matrix, as shown in [HKC"*'94] and [BFCP+10]. 

It can be shown that representing a code with “practical” parameters and using NNF B.f. ’s is as convenient as the usual 
representation of the code. 

2 ) Number of coefficients of the NNF: In order to prove that representing a code with practical parameters and using NNF 
B.f. ’s is as convenient as the usual representation of the code, in this section we want to study the distribution of the number 
of nonzero coefficients of a B.f. represented in NNF, i.e., once the number of variables k is fixed we want to know how many 
B.f. ’s have only one nonzero coefficient, how many have two, and so on. 

We are also interested in finding a relation between this distribution and the distribution of the number of nonzero coefficients 
of a B.f. represented in ANF. 

In Table I we report the distribution of the nonzero coefficients of B.f. ’s represented in ANF and NNF with k = 1,2, 3,4 
variables. As one may expect, the ANF follows a binomial distribution. This means that choosing a random B.f. its ANF is 
likely to have half of the coefficients equal to 0 and half equal to 1. This does not happen for the NNF, although for k small 
the two distributions are close. This means that, when k is small, a random binary (n,2'= )-nonlinear code can be represented 
with a set of B.f. ’s in NNF with half of the coefficients equal to 0 with high probability, while sparse NNF representations 
are more rare as k grows. 


k 0 1 

A: 1 12 

N: 1 12 

A; 2 1 4 

N: 2 I 4 

A: 3 I 8 

N: 3 1 8 

A: 4 1 16 

N: 4 1 16 


2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 

1 .... . . . ....... 

1 . . . . . . . ....... 

6 4 1 - - - - - 

542 - - - - - - 


28 

56 

70 

56 

28 

8 

I 


- 



- 



19 

42 

59 

50 

34 

28 

15 


- 



- 



120 

560 

1820 

4368 

8008 

11440 

12870 

11440 

8008 

4368 

1820 

560 

120 

16 I 

65 

304 

840 

1768 

3250 

5458 

8077 

9986 

9819 

7948 

5954 

4458 

3193 

2830 1569 


Table I 

Distribution of the nonzero coefeicients in the ANF and NNF. 


Proposition 2: Let / be a B.f. in k variables. Let f^^'i and be respectively the ANF and the NNF of /. Then if f^^'> 
is a polynomials with r < 2^ nonzero coefficients, then is a polynomial with no more than min{2^,2’' — 1} nonzero 
coefficients. 

Proof 1: When computing the NNF from the ANF we have again the r initial terms of the ANF, plus terms which are 
all possible double product of the r initial terms, plus, in general, (Q terms which are all possible i-product of the r initial 
terms, for each i G {1,..., r}. Thus we will have 


i=i 

terms to be summed together. If no sum of similar monomials becomes zero than we have 2’’ — 1 nonzero terms. 

By Proposition 2, if we want a NNF with no more than s terms then we have to choose the ANF with no more than 
r — log 2 (s + 1) terms. 

Proposition 3: Let / be a linear B.f. in k variables. Let f^^'> and be respectively the ANF and the NNF of /. Thus, 

for ii < i 2 < ■ ■. < ir,r < k, 

= Xi^ + . . . + Xi^ , 



= 2 ’'-! 


( 3 ) 
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for r < k. Then is a polynomial with exactly 2^ — 1 nonzero coefficients: 


E (- 1 ) 


W(l7) —1 


•uC{F)’' 

v—{vi,...,vh)^0 


w{v) — 1 


Vi Vr 


Proof 2: Directly from Proposition 1. 

Proposition 3 shows that for a linear B.f., its NNF representation is much denser than its ANF representation. 


III. Finding the codewords with weight exactly t 

It is possible to construct a polynomial with integer coefficients whose evaluations in {0,1}^ C if are the weights of the 
codewords of the code C. 

Definition 2: Let X = {xi,... ,Xk], and — X = {x^ — xi,... ,x\ — Xk}- We call the weight polynomial of the code 
C the polynomial 

n 

tvcix) = ^ /f )(x) e Z[x]/{x2 - X), 

where the are the defining polynomials of the code C in NNF. 

Theorem 1: Let v G {0,1}^ C if. Then there exists a codeword c G C such that w(c) = rDc(u). 

Proof 3: It is sufficient to note that Wc G C,c = {f[^\p ),..., f^\P)) for some P G {0,1}^, and that the sum of all fj^^ 
(y) 

is over the integers, with f^ ' {P) > 0, for i = 1,... ,n. 

Once we have the weight polynomial toc of the code C, not only we can find the minimum weight of C, but we also find 
which are the codewords having certain weights by looking at its evaluation vector over the set {0,1}^. As we will see in 
Section V-D, computing this evaluation has a cost of 0{k2^). The complexity maintains the same order if the number of terms 
of each defining polynomial in NNF is on average 0(^2^). 

We summarize in Algorithm 1 the steps to obtain the weight distribution of a binary (n, 2*)-code C given as a list of 2^ 
codewords (and thus also the minimum weight of C), by finding the evaluation vector of the weight polynomial tnc- We 
indicate with Cij the j-th component of the i-th word of C, with 1 < j < n and 1 < i < 2*. 

Algorithm 1 To find the weight distribution of a binary nonlinear code C. 

Require: ci,..., C 2 fc G C 
Ensure: the evaluation vector of tnc 
I: G- NNF of the binary vector {Cij,..., C 2 i=j) for 1 < j < n 

2: tuc ^ /f ^ + . . . + /f ^ 

3: Me Evaluation of toe over {0,1}* 

4: return 


IV. Finding pairs of codewords with distance exactly t 
It is straightforward to adapt the techniques in Section III to the computation of the distance distribution of a code C. 
First, we show how to construct a polynomial with integer coefficients whose evaluations in {0,1}^^ C are the distances 
of all possible pairs of codewords of the code C. 

Definition 3: Let X = xi,..., Xk, X = x \,..., Xk, and X^ — X = x\ —x\, ■ ■ ■ ,xj. —Xk, X^ — X = xf —xi,..., Xk^ —Xk- 
We call the distance polynomial of the code C the polynomial 

n 

c)cW = E(/f^w-/f^w)' 

G Z[X, X]/(X2 - X, X^ - X), 

(Z) 

where the s are the defining polynomials of the code C in NNF. 

Notice that the squaring operation does not introduce squared variables in the expression of hc", because we are working in 
the quotient ring Z[X, X]/(X^ — X, X^ — X). 

Notice also that, for v = (ui,..., Vk,Vk+i, ■ ■ ■, V 2 k) G {0,1}^^, we have that 0c'((^'i5 ■ ■ ■ ,Vk, Vk+i, ■ ■ ■, V 2 k)) = 0 if and only 
if Vi = Vk+i for i = 1,... ,fc, and that c)c((vi, • ■ .,Vk,Vk+i,. ■ • ,W 2 fc)) = 0c((t^fc+i, ■ • ■,V 2 k,vi,.. .,Vk)). 

Theorem 2: Let v G {0,1}^* C Z^* such that (vi,..., Vk) f (ufc+i, ..., V 2 k)- Then there exists a pair of distinct codewords 
ci,C 2 G C such that d(ci,C 2 ) = 0c(^’)- 
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Proof 4: Note that Vci,C 2 C 2 we have that C 1 -C 2 = • ■ •, ifn^HP) - € {0,1}”, 

for some P,Q€ {0,1}*, P f Q. The squaring operation is needed in order to correct those components which have become 
a —1 after the subtraction operation. Finally, the sum of all is over the integers. 

We summarize in Algorithm 2 the steps to obtain the distance distribution of a binary (n, 2^)-code C given as a list of 2^ 
codewords (and thus also the minimum distance of C), by finding the evaluation vector of the distance polynomial Oc- We 
indicate with Cij the j-th component of the i-th word of C, with 1 < j < n and 1 < i < 2*. 

Algorithm 2 To find the distance distribution he of a binary nonlinear code C. 

Require: ci,..., 02 ^ G C 
Ensure: the evaluation vector of he 
1: ■(— NNF of the binary vector ..., (^ 2 * j) for 1 < j < n 

2 : he ^ (/f ^(A) - f[^\x)f + ... + Un\x) - /f ^(A))2 
3: he Evaluation of he over {0,1}^^ 

4: return h(^ 


V. Complexity considerations 

First of all let us notice that given a binary (n, 2^)-code as a list of 2^ codewords, to find the weight distribution of a binary 
nonlinear code C using brute force requires n2* bit operations, since we have to check each component of each codeword of 
C. Similarly, to find the distance distribution, n2^^ operations are needed. 

We note that the operations involved in our following complexity estimates are over the integers, but the size of the integers 
involved in our operations is limited by 2*, and they have a sparse binary representation in the random case (they are sparse 
sums of powers of 2). 

We now analyze the complexity of Steps 1, 2, and 3 of Algorithm 1 and 2. Then, due to the similarities of the two algorithms, 
we only concentrate on the first one. We compare our method to compute the minimum weight of a binary code with brute 
force and, in the linear case, with the Brouwer-Zimmerman method ([Zim96]). We provide more emphasis on the comparison 
in the linear case, since no other methods than brute force are known in the nonlinear case, (with the exception of [PVZ12], 
[VZP14]). 

A. From list of codewords to defining polynomials in NNF 

Proposition 4: The overall worst-case complexity of determining the coefficients of the n defining polynomials in NNF of 
the code C given as a list of vectors is 0{nk2^). 

Proof 5: We want to find the NNF of the Boolean function whose truth table is given by a column of the binary matrix 
whose rows are the codewords of the code C. In [CG99, Proposition 2] it is shown that to compute the NNF of a Boolean 
function in k variables given its truth table requires k2^~^ integer subtractions. Since we have to compute the NNF for n 
columns the overall complexity is 0{nk2^). 

B. From defining polynomials to weight polynomial 

Proposition 5: The overall worst-case complexity of summing together all the defining polynomials in NNF is 0(n2^). 
Proof 6: Each monomial m in a defining polynomial is square-free, and since m G Z[a;i,..., Xk], then a defining polynomial 
can have no more than 2^ monomials. Since the defining polynomials are n, the proposition follows. 

Remark 2: Clearly, the computational complexity of this steps decreases if the defining polynomials are sparse when 
considering their NNF. 

C. From defining polynomials to distance polynomial 

Proposition 6: The overall worst-case complexity of Step 2 of Algorithm 2 is 0{n2'^^). 

Proof 7: The sum = fji'^\x) — f^'^\x) for z = 1,..., n is just a concatenation of coefficients, where the coefficients of 
fl'^\x) need to have their sign switched. 

The polynomial obtained has 2^+^ terms in the worst case, and squaring it requires integer multiplications and the 

same number of integer sums, for a total of 2^^+^ integer operations. Since we have n such polynomials fi, to compute their 
square requires n2^*+^ integer operations. Each fi has at most 2^^ terms, since fi G Z[A, A]/(A^ — X, — A). Summing 
all fi together thus requires at most n2^^ integer sums. The overall worst-case complexity of Step 2 of Algorithm 2 is then 

n22'=+3 + n2^^ = n2^'^{2^ + 1). 

Remark 3: Again, the complexity of this step is lower if the defining polynomials are sparse in their NNF. If, for example, 
the nonzero coefficients of f^'^\x) are ~ k, so are the coefficients of f^'^\x), and the squaring of fi requires ^ (2kfi integer 
operations. 
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D. Evaluation of the weight and the distance polynomial 

Algorithm 3 describes the fast Mobius transform to compute the evaluation vector of a Boolean function / in NNF in k 
variables. 

We use the following notation; the coefficient C 2 fe is the coefficient of the greatest monomial, i.e. of xi ■ ■ ■ Xk, C 2 fe_i the 
coefficient of the second greatest monomial, and so on until ci, which is the costant term. We provide Example 1 to clarify 
our notation. 

Notice that the sum in Step 6 is over our integers. If it was a sum in F then we would obtain the truth table of /. 

Algorithm 3 Fast Mobius transform for fast integer polynomial evaluation. 

Require: vector of coefficients c = (ci,..., C 2 fc) 

Ensure: evaluation vector e = (ei,..., 62 *) 

1 : e c 

2 : for i = 0,..., fc do 
3: & 0 

4: repeat 

5: for x = 6 + 2* — 1 do 

6: ea;_|_i_|_ 2 i ^ Gx+l + &x+l+2* 

1: 5^6 + 2*+^ 

8: until b = 2^ 

9: return e 

Example 1: Consider k = 3 and lexicographical ordering with xi y X 2 >- x^. Let / = 8 a;iX 2 X 3 + 3xi + 2. Then 
c = (ci,..., eg) = (2, 0,0,0, 3,0, 0, 8 ) and e = (ei,..., eg) = (2, 2, 2, 2, 5, 5, 5,13). 

Proposition 7: Evaluating the weight polynomial over the set {0,1}* has a computational cost of 0(k2^). 

Proof 8: This is the cost of Algorithm 3, i.e. k2^~^ integer sums. 

Similarly 

Proposition 8: Evaluating the distance polynomial over the set {0,1}^^ has a computational cost of 0{k2^^). 

E. Comparison with brute-force method 

Because of the similarities of Algorithms 1 and 2, we now concentrate our analysis only on Algorithm 1. All considerations 
we expose can be easily extended for Algorithm 2. 

Theorem 3: Let ft, be a positive integer. If the code C is given as a set of B.f. ’s whose NNF have on average 2^/ft coefficients 
different from 0, then computing the minimum weight of C requires at most 



Proof 9: By Proposition 8 computing the evaluation vector of the weight polynomial ruc requires k2^ ^ integer sums using 

(zT 

the fast Mobius transform. To compute the weight polynomial we need to sum the n defining polynomials f^ ,i = 1,... ,n, 
in NNF. If each of these polynomials has on average 2^/ft coefficients then the complexity of computing toe requires O(n^) 
integer sums. So the final complexity is at most (n/ft)2^ + k2^~^. 

Remark 4: Our method is more efficient than brute force when n/h-\- k < n. This is very likely to happen for a random 
code of low information rate where ft <C n. If ft ~ n and the NNF is dense, then it is convenient to use brute force rather 
than our method. 

Notice also that if the sets of nonzero monomials of two polynomials in NNF are disjoint, then the sum of the two polynomials 
is simply their concatenation. So, if the defining polynomials of a code are “disjoint”, then the cost of computing the weight 
polynomial is 0 ( 1 ), and the final cost of finding the minimum weight becomes the cost of computing the evaluation of toc, 
i.e. 0 (ft 2 '=-i). 

Fact 1 shows that, for n ^ ft, when the code is linear our method to compute the minimum nonzero weight (i.e. the distance 
of the code) given the set of the defining polynomials in NNF is more efficient than the classical method which uses brute 
force, given the list of the codewords of the code. 

Fact 1 (Comparison with brute force, linear case, n ^ 2^): Consider a random binary [n, ft]-linear code C such that n ~ 2^. 
Then computing the weight distribution of C 

1 ) given the list of its codewords and using brute force requires 0 ( 2 ^^). 

2) given the list of the defining polynomials in NNF and finding the minimum of toe requires 0 ( 22 *). 

Proof 10: The complexity of finding the weight distribution of C in case 1 is 0(n2*) = 0(2^*), since n 2*. 

The complexity of finding the weight distribution of C in case 2 is 0((n/ft + ft)2*) (by Theorem 3), where n/h is the average 
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number of nonzero coefficients of the NNR If the linear code C is random, then so are the random linear defining polynomials. 
A random linear function in k variables has on average k/2 nonzero coefficient in ANF and thus 2^/2 

— 1 nonzero coefficients 

in NNF , i.e. n//r ~ 2'=/^, and 

0{{n/h + k)2^) = + k)2^) = 0(2^'=). 

Fact 2 (Comparison with brute force, nonlinear case, n ~ 2^j; Consider a random binary (n, 2^)-nonlinear code C such that 
n ~ 2’=, and whose defining polynomials have on average k/2 nonzero coefficients in the ANF. Then computing the weight 
distribution of C given the list of the defining polynomials in NNF and finding the minimum of tuc requires 0(22^). 

Proof 11: The arguments are the same as in the proof of Fact 1, except that this time the nonzero coefficients of the NNF 
are less than 2^/^ — 1. This implies that in practice the overall complexity in this case is even lower, as shown in Table IT 
In Table II we show the coefficient of growth of the complexity of our method in three different cases. The first line shows 
the coefficient of growth of the brute force method applied to a linear code. The second line shows the coefficient of growth 
of our method applied to a linear code. In the third line our method is applied to a nonlinear code whose ANF representation 
is sparse, and in the last line nonlinear codes with dense ANF representation are considered. 

For the comparison we choose for each k, 10 random (2^, 2*)-codes and 10 random (2*+^, 2*+^)-codes and compute the 
average times ti,t 2 to compute the minimum weight in each case. Then we report the number log 2 (fi/f 2 )- 
We can see, as expected, that our method performs best in the case of sparse nonlinear ANF. 


k 

00 

1 

to 

9-10 

10 - 11 

11 - 12 

Brute-force Linear ANF 

1.93 

1.98 

2.00 

1.99 

Linear ANF 

1.32 

1.38 

1.53 

1.61 

Sparse Nonlinear ANF 

0.89 

1.12 

1.32 

1.38 

Dense Nonlinear ANF 

2.09 

2.03 

2.04 

2.08 


Table II 




Coefficients of growth of our method compared with brute force. 


F. Comparison with Brouwer-Zimmerman method for linear codes 

In the linear case the defining polynomials of a code C clearly have a sparse ANF. If a defining polynomial in F[a;i,..., Xk] 
is linear and with less than k variables, than many coefficients of the NNF are 0, precisely, the coefficients of the monomials 
containing the missing variable in the ANF. In this case the computation of the weight distribution of C (and thus of the 
distance of C, since the code is linear) is faster than brute force. 

In Table III we compare the time ti needed to compute the minimum weight w of a linear code given as list of codewords 
with the MAGMA command 


MinimumWeight(CiMethod:="Zimmerman"), 

with the time t 2 needed to compute w when the code is given as a list of B.f. ’s in NNF using our method. The comparison 
has been done for 10 random linear codes fixing a pair (fc,n), with n ^ A:. In the column Wav the average minimum weight 
found is shown. 

An AMD E2-1800 APU processor with 850 MHz has been used for the computations. We can see that there are cases, i.e. 


k 

n 

tl 

t2 

ti/G 

"^av 

8 

lOOfe = 800 

0.043 

0.007 

6.143 

360.1 

8 

150fc = 1200 

0.122 

0.012 

10.17 

554.1 

8 

200fc = 1600 

0.122 

0.015 

8.13 

745.2 

8 

250fc = 2000 

0.171 

0.011 

15.55 

935.0 

9 

lOOfe = 900 

0.833 

0.019 

4.368 

403.1 

9 

150fe = 1350 

0.116 

0.020 

5.800 

615.6 

9 

200fc = 1800 

0.277 

0.024 

11.54 

834.0 

9 

250fc = 2250 

0.256 

0.029 

8.828 

1050.0 

10 

lOOfe = 1000 

0.050 

0.031 

1.613 

448.3 

10 

150fe = 1500 

0.136 

0.041 

3.317 

687.5 

10 

200fe = 2000 

0.178 

0.050 

3.560 

922.7 

10 

250fe = 2500 

0.185 

0.056 

3.304 

1168.3 


Table III 

Comparison with Brouwer-Zimmerman method. 


{k,n) = (8,1200) or (fc,n) = (9,1800), where our method is 10 times faster than the Brouwer-Zimmerman method. This 
is not surprising, since the it is known that there are cases where brute force performs better than the Brouwer-Zimmerman 
method. 

We also recall that Brouwer-Zimmerman method is probabilistic, while our method is deterministic. 
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VI. Binary codes whose cardinality is not a power of 2 

Algorithm 1 can be modified to work also with binary codes whose cardinality is not a power of 2. We only mention two 
techniques that can be used. 

A first method consist in expanding the code until it reaches a size of 2^. The key observation is that the minimum weight 
vector of a list of vectors in (F)” (i.e. the codewords of C) is equal to the minimum weight vector of the same list concatenated 
to the list of some repeated words of C (eventhough this new list is not a code anymore). 

A second approach is to divide the code C in subcodes whose cardinality is a power of 2. Then to each of these codes we 
can apply Algorithm 1 and then take the minimum of all the results. See [Bell4] for details. 
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